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METHOD AND APPARATUS FOR ERROR DETECTION AND 

CORRECTION 

Field of the Invention 

The present invention relates to integrated circuit dynamic memories, 
5 and more specifically to methods of detecting and correcting errors in 
dynamic memories. 

Background of the Invention 

Typical dynamic memories store data as charges on storage capacitors. 
Over time the storage capacitors lose charge and eventually the memory loses 
10 data. Dynamic memory devices therefore require a refresh operation to 
maintain the charge on the storage capacitors and thus maintain data. The 
refresh operation typically involves sensing the charge on the storage 
capacitor and reinforcing the charge. 

In a dynamic memory such as a Dynamic Random Access Memory 
15 (DRAM), the refresh operation does not avoid all losses of data. Loss of data 
may be due to irreparable hardware failures, referred to as hard errors. 
DRAMs also experience transient loss of data, referred to as soft errors. The 
number of soft errors experienced per size of memory and over time is 
referred to as the Soft Error Rate (SER) or Failure In Time (FIT) rate. 

20 The system data integrity and reliability is often measured by the SER, 

The SER is responsive to a variety of design, manufacture, and application 
specific issues. The marginal design of hardware may effect the SER by not 
considering the most common causes of such failures. Similarly, operating 
conditions may tend to increase the SER. For example, poor regulation or 

25 intentional deregulation of power supplies compromises the ability to 

correctly sense the contents of the DRAM bit cells. Additionally, operation of 
the DRAM at high temperatures increases leakage of the storage cell affecting 
the ability of the cells to maintain the stored charge. Another consideration is 
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packaging, where problems relating to radioactivity, as well as other 
manufacturing and packaging issues effect the SER. Packaging materials that 
emit alpha particles and/or cosmic radiation are classically studied sources of 
SER problems in DRAMs. 

5 In general, DRAM design and manufacture balance SER requirements 

against cost, as reducing SER typically adds additional circuitry, reduces 
operating speed and increases power dissipation. As the DRAM industry 
moves to lower voltage operation, the stored charge of the storage cells is 
reduced, increasing SER, Additionally, advances in semiconductor process 
10 technology and economic pressure to reduce cost by increasing circuit density 
tend to result in reduction of the capacitance of DRAM storage capacitors. 
Reduced charge (Q = CV) stored in DRAM bit cells results in increased SER 
because of an inverse exponential relationship between stored charge and 
SER. 

15 Various methods have been developed to detect and correct soft and 

hard errors. An Error Detection And Correction unit (ED AC) is used to detect 
errors in stored data, and if possible, correct errors in the data. EDACs greatly 
improve data integrity. The operation of one type of ED AC is based on a code 
word. Data to be stored in the memory is provided to the ED AC. The ED AC 

20 then generates check bits based on the data value. The check bits are then 

combined with the data to form a code word. The code word is then stored in 
the memory. To check the data, the ED AC reads the code word from the 
memory and recalculates the check bits based on the data portion of the code 
word. The recalculated check bits are then compared to the check bits in the 

25 code word. If there is a match, the data is correct. If there is a difference and 
the error is correctable, the ED AC provides the correct data and check bits as 
an output. If there is a difference and the error is detectable but uncorrectable, 
the EDAC reports the occurrence of a catastrophic failure. 

A variety of EDAC techniques and circuits are available, as are a variety 
30 of methods for generating code words and performing bit checks. Some 
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methods are discussed in U.S. Patent No. 5,598,422, by Longwell, et ah, entitled 
"Digital computer having an error correction code (ECC) system with 
comparator integrated into re-encoder," and in Error-Correction Codes, by W. 
W. Peterson, 2d edition, MIT Press (1972). 

5 The codeword generated by an EDAC is dependent upon the size of the 

data component and the required level of detectability and correctability of 
errors. As the number of errors the EDAC is able to detect and the number of 
errors the EDAC is able to correct increase, the number of check bits, and thus 
the number of bits in the codeword increases. As the codeword increases so 
10 does the complexity of the EDAC . This complexity adds to the circuitry 

required and also reduces the speed of operation of the EDAC and therefore of 
the DRAM. 

•SPSS:, 

ft Therefore a need exists for an efficient method to detect and correct 

U errors in a dynamic memory. A need exists for an EDAC apparatus that 

ffj 15 reduces the SER even as the size of the memory cell shrinks, and over a broad 

!£? range of considerations, such as manufacturing, packaging, design, and 

a application. Further, a need exists to develop an integrated method of error 

m detection and correction for an embedded DRAM, where the DRAM includes 

an array of memory tiles. Still further a need exists to increase the capability 
y3 20 of error detection and correction without increasing the complexity of the 

individual EDAC units. 
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Brief Description of the Drawings 

The present invention may be more fully understood by a description of 
certain preferred embodiments in conjunction with the attached drawings in 
which: 

5 Fig. 1 illustrates in block diagram form a memory system having 

distributed error detection and correction (EDAC) units according to one 
embodiment of the present invention; 

Fig. 2 illustrates in block diagram form a memory system having 
distributed error detection and correction (EDAC) units according to an alternate 
10 embodiment of the present invention; 

Fig. 3 illustrates in block diagram form a memory system having a 
column-wise parity error detection circuit according to one embodiment of the 
present invention; 

Fig. 4 illustrates in block diagram form a memory system having 
15 orthogonal EDAC units according to one embodiment of the present invention; 

Fig. 5, comprising Figs. 5A and 5B, illustrates in tabular form one method 
of row address decode and code word assignment as implemented in Fig. 4; 

Figs. 6 and 7 illustrate in block diagram form portions of an orthogonal 
EDAC unit as in Fig. 4; 

20 Fig. 8, comprising Figs. 8A to 8D, illustrates in flow diagram form error 

detection and correction as in Fig. 4; and 

Fig. 9 illustrates in block diagram form a three-dimensional memory array 
comprising a stack of planes of rows and columns, each plane having respective 
row and column EDACs, and a stack EDAC. 

25 Detailed Description of the Preferred Embodiments 

Throughout this description the terms "assert" and "negate" will be used 
when referring to the rendering of a signal, status bit, or similar apparatus into 
its logically true or logically false state, respectively. Similarly, with respect to 
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information or data stored in a memory, a "zero" value is a low potential value, 
and a "one" is a high potential value. 

One embodiment of the present invention provides a means of reducing 
the complexity of a single, large ED AC by distributing the EDAC function across 
5 multiple, smaller EDAC circuits. This reduces the complexity of the EDAC 
calculation and increases the speed of operation and throughput. 

Another embodiment of the present invention extends the operating 
environment in which a DRAM can reliably store data. In addition to traditional 
word-wise EDAC units, the present invention provides a bit-wise error detection 

10 and correction capability that expands the range of errors detected and corrected. 
The bit- wise error detection and correction may incorporate a simple bit- wise 
parity calculation, which allows many double-bit word-wise errors to be 
detected and corrected in concert with a word-wise EDAC unit. The bit-wise 
method may be expanded to provide a full bit-wise EDAC, where the same bit of 

15 each word stored in the DRAM is used to calculate check bits which are stored in 
the bit-wise EDAC. If desired, the DRAM can be partitioned into two or more 
sub-arrays, either physical (e.g., contiguous) or logical {e.g., interleaved or 
interdigitated), and a separate bit-wise parity generator or EDAC provided for 
each partition, thereby reducing correction time and increasing coverage of 

20 related or correlated errors. In this way, multiple double-bit errors are 

correctable at low hardware cost. For convenience of reference, we will refer to a 
word-wise EDAC as an "WEDAC" and a bit-wise EDAC as a "BED AC". 

In one aspect of the present invention a memory system includes a 
memory having a plurality of dynamic memory cells, each cell storing a bit, an 

25 access circuit connected to the memory to access, during an access cycle, a 

selected set of the bits stored in said memory cells, and an error detection circuit 
connected to the access circuit and said memory to detect an error in a bit 
accessed during said access cycle. The error detection circuit includes a first 
error detection circuit to detect an error in a bit of a first subset of said accessed 

30 bits; and a second error detection circuit to detect an error in a bit of a second 
subset of said accessed bits. 
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In another aspect of the present invention, a memory system includes a 
memory having first and second groups of dynamic memory cells, each cell 
storing a bit, an access circuit connected to the memory and an error detection 
circuit. The access circuit is adapted to access, during an access cycle, a selected 
5 one of a first set of the bits stored in said first group of said memory cells, and a 
second set of the bits stored in said second group of said memory cells. The error 
detection circuit is connected to the access circuit and said memory to detect an 
error in a bit accessed during said access cycle. The error detection circuit 
includes a first error detection circuit to detect an error in a bit of said first set of 
10 accessed bits, and a second error detection circuit to detect an error in a bit of 
said second set of accessed bits. 

According to still another aspect of the present invention, a memory 
system includes a memory having a plurality of dynamic memory cells arranged 
in a plane of rows and columns, each cell storing a bit, an access circuit 

15 connected to the memory to access, during an access sequence, all of the bits 
stored in said plane of said memory cells, and an orthogonal error detection 
circuit connected to the access circuit and said memory to detect an error in a bit 
accessed during said access sequence. The orthogonal error detection circuit 
including a row error detection circuit to detect an error in a bit of a row of said 

20 accessed bits, and a column error detection circuit to detect an error in a bit of a 
column of said accessed bits. 

In one embodiment, the present invention, a circuit for use in a memory 
system includes a memory having a plurality of dynamic memory cells arranged 
in a plane of rows and columns, each cell storing a bit, an access circuit 

25 connected to the memory to access, during an access sequence, all of the bits 
stored in said plane, and a row error detection circuit connected to the access 
circuit and said memory to detect an error in a bit of a row of said accessed bits. 
The row detection circuit having a parity generation circuit connected to said 
memory to generate a parity bit related to all bits stored in a respective one of 

30 said columns. 

In another embodiment of the present invention, a random access error 
detection and correction (RAEDAC) unit for detecting and correcting errors in an 
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ordered bit string of predetermined length, includes a parity generation circuit 
which receives, in any order, each bit of said string, and generates a plurality of 
parity bits, each related to a unique combination of said bits comprising said 
string, an error detection circuit connected to said parity generation circuit to 
5 detect an error in a bit of said string using said parity bit, and an error correction 
circuit coupled to the parity check circuit to correct said detected bit error. 

FIG. 1 illustrates, in block diagram form, memory system 10 according to 
one embodiment of the present invention, where the memory system 10 
interfaces with, and forms a portion of, a conventional data processing system 

10 (not illustrated). The memory system 10 includes a memory controller 12 which 
transmits and receives data on a data bus 14. The data bus 14 is illustrated as 
having "x" conductors for simultaneously communicating x bits of data. The 
memory controller 12 receives the x bits of data via data bus 14 and distributes 
the data to a first WEDAC unit, WEDACO 16 and to a second WEDAC unit, 

15 WEDAC1 18. 

The memory controller 12 generates chip select signals, CSO and CS1, to 
selectively enable WEDACO 16 and WEDAC1 18, respectively. When a chip 
select signal is asserted, data is provided to the corresponding WEDAC unit. The 
chip select signals may also be referred to as WEDAC select signals. 

20 The memory controller 12 distributes the data by providing one portion to 

WEDACO 16 and another portion to WEDAC1 18. As illustrated in Fig. 1, 
WEDACO 16 receives a bits of data, where a is a predetermined number, usually 
a power of 2, whereas WEDAC1 18 receives the remaining (x - a) bits of data. 
Note that memory system 10 is an exemplar of a distributed EDAC memory 

25 system; alternate embodiments may include any number of EDAC units. 

According to one embodiment, each of the multiple WEDAC units 
receives an equal number of data bits, such as where x = 128, a = 64, and (x - a) = 
64. For multiple WEDAC units each receiving an equal number of data bits, the 
total data bits, x, is divided by the total number of WEDAC units to determine 
30 the number of data bits each WEDAC unit receives. According to an alternate 
embodiment, each WEDAC unit receives a predetermined number of data bits, 
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which are not necessarily equal. As more WED AC units are added, the memory 
controller 12 generates additional chip select signals. In systems having dynamic 
bus sizing, subsets of the multiple WED AC units may be selectively enabled on a 
cycle-by-cycle basis, depending upon the respective bus size. It may be 
5 advantageous to selectively enable a portion of the data bus through WEDACO 
16 while the rest of the data bus is not active. In general, the number of check 
bits required is sized according to the size of the active portion of the data bus 14. 

In general, the number of check bits is a function of the desired system 
integrity, as well as the number of data bits. A WED AC unit forms a code word 

10 by generating a plurality of check bits depending upon the value of a set of data 
bits. As illustrated in Fig. 1, the code word generated by WEDACO 16 will have 
(a + $) bits. The so-called "Hamming distance" between two valid code words is 
the number of bit positions in which the code words differ. In general, if two 
code words are a Hamming distance, d, apart, it will require d single-bit errors to 

15 convert one into the other. For single-bit error detection and correction, the 
number of check bits, s, is thus determined by the Hamming equation, given in 
our example as: 

(a + s + l)<2 s 

which allows code words to be separated such that single-bit errors do not 
20 convert one code word into another. This is derived from the fact that each of 

the 2 a valid code words handled by WEDACO 16 has (a + s) invalid code words 
at a distance of 1 from it, formed by inverting each of the (a + s) bits in the code 

word. Therefore, each of the 2 a valid code words is surrounded by a buffer 
zone comprising (a + s + 1) invalid bit patterns. The restraint is then reduced to 
25 the equation provided herein above. 

The generated code words are simultaneously provided to the dynamic 
random access memory (DRAM 20) by WEDACO 16 and WEDAC1 18. The 
WEDACO 16 adds a predetermined number of check bits, s, to the a data bits, 
while WEDAC1 18 adds a predetermined number of check bits, f, to the (x - a) 
30 data bits. The result is that (x + s + t) bits of information are provided to the 
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DRAM 20. All of the check bits calculated by WEDACO 16 and WEDAC1 18 are 
stored in the DRAM 20, together with the data bits. The check bits may be 
concatenated and stored at the end of all the data bits, or they may be distributed 
throughout the data bits in any desired manner. 

5 In one embodiment, memory controller 12 receives 128 data bits, and 

provides WEDACO 16 with 64 bits and WEDAC1 18 with 64 bits. Each WEDAC 
generates eight (8) check bits for each of the 64 bits of data. Note that in a 
memory system having only a single WEDAC unit, the original 128 bits of data 
would only require nine (9) check bits. Here the total for the two WEDAC units 
10 is 16 check bits. On the other hand, the circuitry of the smaller WEDAC units is 
less complex than that of a single large WEDAC unit, thus decreasing the 
computation time and circuit area for the WEDACs, as well as increasing 
throughput and scalability. 

In an alternate embodiment illustrated in Fig. 2, memory system 22 

15 includes two (or more) DRAM banks which are interleaved, allowing pipelined 
or partitioned operation, wherein one bank may be refreshed while the other is 
accessed for normal operation. As in the embodiment shown in Fig. 1, the 
memory controller 12 generates a separate chip select signal for each WEDAC 
unit. The WEDACO 16 receives a predetermined number, a, of data bits, while 

20 the WEDAC1 18 receives a predetermined number, b, of data bits, where x = a + b 
and a and b are not necessarily equal. Each WEDAC unit calculates an 
appropriate number of check bits according to the number of data bits received. 
The WEDACO 16 generates s check bits to form a code word having (a + s) bits 
for storage in the DRAMO 24. Similarly, WEDAC1 18 generates * check bits to 

25 form a code word having (b + t) bits, which is then stored in DRAM1 26. Note 
that chip select signals, CSO and CS1 are each coupled to DRAMO 24 and DRAM1 
26, respectively. In this way, the memory system 22 may "scrub" the DRAMO 24 
while the DRAM1 26 is in normal operating mode and vise versa. As one DRAM 
may be scrubbed in isolation, the scrubbing operation does not impact the speed 

30 or function of the other DRAM. Note that DRAMO 24 and DRAM1 26 may be 
banks or tiles within a single chip memory, or may be discrete memory devices. 
Such a tiled memory system and a method for scrubbing are disclosed in our 
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copending patent application entitled "METHOD AND APPARATUS FOR 
REFRESHING AND SCRUBBING A DYNAMIC MEMORY " by Longwell et al., 
having Attorney Docket No. JMS008-00, filed May 18, 1999, and assigned to the 
assignee hereof, which is expressly incorporated herein by reference. 

5 The distributed WEDAC aspects of the present invention allow the system 

designer to customize the WEDAC units to the particular characteristics of the 
memory serviced. The smaller size of the individual WEDAC units increases the 
speed of the overall operation. Note that while two (2) WEDAC units are 
illustrated in Figs. 1 and 2, any number of WEDAC units may be incorporated 
10 based upon relevant price, power and performance considerations. Furthermore, 
some portions of the memory may be covered by simple parity, while other 
portions may have no error detection capability at all. 

Fig. 3 illustrates a column-wise parity error detection method which may 
be implemented in combination with a WEDAC, such as those shown in Figs. 1 

15 and 2. In memory system 28, the DRAM 30 stores code words as concatenated 
strings of data bits 32 and check bits 34. A bit-wise parity detection unit 36 is 
coupled to the DRAM 30. The parity detection unit 36 includes a parity logic 
circuit 38 and a parity register 40. In operation, the parity logic circuit 38 
generates a set of parity bits for storage in the parity register 40, each of which is 

20 based upon the state of the set of corresponding bits in each of the code words 
stored in the DRAM 30. For example, in DRAM 30, the memory cells can be 
thought of as forming an array of rows and columns. Although multiple code 
words may be stored in a single row, it will be assumed for purposes of the 
following discussion that only a single code word (of length (N+l)) is stored in a 

25 single row. In such an arrangement, the DRAM 30 can be envisioned as a set of 
(N+l) columns, numbered 0 to N. Of course, the memory array may include any 
number of rows and any number of columns, and it is not necessary that the 
number of rows be equal to the number of columns. Generally, the width of the 
parity register will be equal to the width of the code word, although it may be 

30 less. 

Each parity bit is calculated by counting the number of ones (l's) in a 
given column. Parity may be calculated according to either an odd or an even 
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scheme, although odd parity is usually preferred. For an odd parity scheme, if 
the number of ones (l's) in the column is odd, the parity bit for that column will 
be zero (0), as the column already has odd parity. If the number of ones (l's) in 
the column is even, the parity bit for that column will be a one (1). The parity bit 
provides a method of identifying errors in a column without column specific 
information, such as which bits are ones (l's) and which bits are zeroes (O's). The 
column-wise parity is calculated based only on the total number of ones (l's) in 
the column. This allows the parity indicator for each column to be updated at 
the time of each write to memory, based on the "old" bit" (i.e., the respective bit of 
the code word previously written into that column), the "new" bit (i.e., the 
respective bit of the new code word which is going to be written into that 
column), and the current value of the corresponding parity indicator. 

According to one embodiment, at the time of power-on initialization a 
predetermined valid code word bit pattern is written to each word location in the 
DRAM 30. All parity bits in the parity register 40 may then be simultaneously 
initialized to an appropriate value, depending upon the selected parity scheme. 
Thereafter, the parity information stored in the parity register 40 will be updated 
as each successive write into the DRAM 30 is performed. According to this 
approach, the DRAM 30 starts operation in a fully valid state and all collateral 
operations, such as scrubbing, can proceed as usual. 

Alternatively, as a part of the initialization sequence, the current-contents 
(i.e., noise) may be read out and the parity logic circuit 38 allowed to calculate the 
current parity and store it into the parity register 40. Thereafter, the parity 
information will be updated as each of the actual code words are written into the 
DRAM 30. We consider this approach to be less desirable, however, since the 
contents of the DRAM 30 at initialization will, in general, not comprise valid 
word-wise code words, and special procedures will need to be provided either to 
generate (and rewrite) valid check bits 34 depending upon the current data bits 
32 or to selectively disable validation and/or scrubbing of invalid code words. 

Continuing now with Figure 3, the parity logic circuit 38 is coupled to a 
WED AC 42 which generates code words to be stored in a selected location in the 
DRAM 30. For a write operation, the WEDAC 42 receives new data bits and 
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generates /concatenates the appropriate check bits to produce the new code 
word. Simultaneously, the DRAM 30 is accessed to retrieve the old code word 
stored in the selected location. Thereafter, parity logic circuit 38 compares the 
new code word to the old code word to determine which, if any, of the parity bits 
in the parity register 40 need to be changed. If a particular bit of an old code 
word is different from the same bit of the new code word then the corresponding 
parity bit will need to be toggled as a result of the write. If the old and new bits 
are the same, the current state of that parity bit will still be correct after the write. 

Whereas the WEDAC 42 is designed for single-bit error correction and 
double-bit error detection, addition of the parity detection unit 36 facilitates 
correction of many double-bit errors detected in a retrieved code word. When 
the WEDAC 42 detects a single-bit error, operation of the parity logic circuit 38 is 
temporarily suspended while the WEDAC 42 fixes the single-bit error, so that the 
parity is maintained at the correct value. To eliminate any other single-bit errors, 
the WEDAC 42 may proceed to scrub the entire DRAM 30. When the WEDAC 
42 detects a multi-bit error, a sequencer (discussed below) employs the parity 
logic circuit 38 to recalculate, for all stored code words, the bit-wise parity bits, 
and then compares each recalculated parity bit to the corresponding bit stored in 
the parity register 40. This bit-wise recalculation operation, which we refer to as 
a "panic scrub", will access the entire parity space covered by the parity register 
40. The recalculated parity bits and the stored parity bits may then be exclusive- 
ORed to determine which bits are different. In this arrangement, each parity bit 
which is different, in general, indicates a error in the corresponding bit of the 
code word containing the multi-bit error. Using this bit-wise information, 
together with the word-wise information provided by the WEDAC 42, the 
sequencer can usually identify and correct the erroneous bits of the code word. 

Occasionally, when one double-bit error is detected, additional double-bit 
errors are present in the DRAM 30. For example, assume that two double-bit 
word-wise errors occur such that one particular bit of each of two different 
stored code words are both in error, i.e., one of the erroneous bits comprising 
each of the double-bit errors, say bit z, is "stacked" in a bit-wise sense. According 
to the parity logic circuit 38, the parity bit corresponding to bit z will still appear 
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to be valid. Furthermore, the parity detection unit 36 will be unable to assist in 
the identification and correction of the other erroneous bits, since insufficient 
information is available to decide which detected single-bit bit-wise error 
corresponds to which of the double-bit word-wise errors. Similar, unresolvable 
5 ambiguities exist when two double-bit word-wise errors occur such that each of 
the erroneous bits is in a different bit position of the respective code words, and 
when multiple stacked double-bit errors occur. It is, of course, quite feasible to 
physically partition the DRAM 30 into two or more distinct subarrays and 
provide a parity error detection unit for each. In such an arrangement, all 

10 double-bit word-wise errors can be detected and corrected so long as no more 
than one occurs simultaneously in the same subarray or "parity space". The odds 
of this happening can be improved by partitioning the DRAM 30 logically, rather 
than physically, in such a way as to insure that adjacent physical rows in the 
DRAM 30 lie in different logical parity spaces. One way to do this would be to 

15 use a selected subset of the address bits to distinguish between parity spaces, 
effectively interleaving or interdigitating the logical subarrays. Such 
interdigitating has been used in conventional WEDACs to greatly reduce the 
likelihood that a single noise event, e.g., an alpha particle, will corrupt multiple 
bits of the same code word. Since simple parity is inadequate to resolve 

20 multiple, double-bit errors within the same parity space, multiple interdigitated 
spaces can significantly improve data integrity in environments in which 
correlated multi-bit errors occur frequently. 

As illustrated in Fig. 4, our bit-wise error detection and correction 
methodology may be extended to combine a BED AC with a WED AC, such as 

25 those shown in Figs. 1 and 2, to provide greater data integrity than bit-wise 
parity in environments exhibiting significant levels of non-correlated (i.e., 
random) errors. In the memory system 44, a memory controller 46 controls the 
operation of the DRAM 30, and a memory interface 48 controls requests for 
access to the DRAM 30 by an external source, such as a processor (not shown). 

30 Coupled to DRAM 30 is a decoder 50 which decodes the row address 

information to access (N+l)-word-wise code words stored at respective storage 
locations within DRAM 30. By way of example, the memory cells within DRAM 
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30 are organized in rows, numbered 0 to 7, and columns, numbered 0 to N. A 
code word is stored in each row, across the data bits 32 and the check bits 34. 
The memory system 44 also includes a WEDAC 52 which receives new data from 
the memory interface 48 and provides appropriate code words to the DRAM 30. 
5 Simultaneously, each code word is provided to a BEDAC 54, which sequentially 
generates a bit-wise code word for each column in the DRAM 30. For 
convenience of visualization, the bit-wise code words are shown stored in a 
columnar fashion, wherein each "column" in the BEDAC 54 stores the several bit- 
wise check bits for an associated column in the DRAM 30. As was the case for 
10 the word-wise WEDAC 52, the bit-wise check bits, together with the data bits in 
a respective column in the DRAM 30 can be thought of as comprising a bit-wise 
code word. As discussed above, the column height can be logically partitioned 
to provide interleaved or interdigitated parity spaces. 

According to the design of one embodiment, the bit-wise check bits 

15 (BCBs) are stored in the rows of the BEDAC 54 as illustrated in Fig. 4, wherein 
the first row of the BEDAC 54 stores BCB0 for each column of the DRAM 30, the 
second row of the BEDAC 54 stores BCB1 for each column of the DRAM 30, and 
so on. As in the WEDAC 52, the error correction algorithm per se implemented 
by the BEDAC 54 can be quite conventional. As is common in such algorithms, 

20 and unlike simple parity, each of the bits comprising each code word in the 
DRAM 30 affects more than one of the respective set of BCBs. For example, 
according to one possible scheme shown in Fig. 5A, the word-wise code word 
bits stored in row 0 in the DRAM 30 affect BCB0, BCB1, and BCB2 of the 
respective bit-wise code word stored in the BEDAC 54, the row 1 code word bits 

25 effect BCB0, BCB1 and BCB3, and so on. To implement such a scheme, the 
BEDAC 54 is coupled to a BEDAC decoder 56 which enables the appropriate 
BCBs according to the decoding table shown in Fig. 5B, wherein RA[2-0] 
comprise the same row address select signals provided by the memory controller 
46 during an access to the DRAM 30. In accordance with our invention, the 

30 BEDAC 54 effectively implements a classic "block error detection and correction" 
code, not in the traditional word-parallel manner, but bit-serially, i.e., in 
whatever order the "data" bits happen to appear, including purely random. We 
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refer to our method of operation as "random access error detection and 
correction" (RAEDAC) to distinguish it from a traditional word-wise error 
detection and correction algorithm applied to a serial data stream in which the 
order of bit presentation is predetermined and cannot vary without invalidating 
5 the algorithm. Before we show how the BEDAC 54 cooperates with the WEDAC 
52 to significantly enhance the ability of the memory system 44 to detect and 
correct random, multiple, double-bit errors, we will describe the WEDAC 42 in 
greater detail. 

As shown in Fig. 6, for each column in the DRAM 30, the BEDAC 54 has 

10 two sets of five (5) toggle flip-flops or "T-flops", one set for each of the BCBs: a 
normal set (Tn[0-4]) which is active during normal operation, and a scrub set 
(Ts[0-4]) which is active only during a scrubbing operation. In such an 
arrangement, each pair of sets of T-flops are associated with a respective one of 
the (N+l) bits comprising a word-wise code word. At the end of a double-bit 

15 error scrub operation, all of the T-flops in the respective set (i.e., either the normal 
or scrub set for a given code word bit location) are simultaneously coupled to a 
corrupt bit select 58, wherein a shared syndrome calculator 60 calculates a 
syndrome. Each syndrome is then decoded by a respective syndrome decoder 62 
in a conventional manner to provide information regarding the number of bits in 

20 error (one, two, or even three, depending upon the selected algorithm), and, if 
only one bit is in error, the row address corresponding to the particular bit in the 
bit-wise code word that is in error. When considered collectively, the set of all 
bit-wise syndromes generated by the BEDAC 54 indicate which of the "columns" 
in the DRAM 30 have bits in error. Similarly, the syndromes generated by the 

25 WEDAC 52 indicate which particular "rows" in the DRAM 30 have bits in error. 
By combining the word-wise and the bit-wise error information, it is possible in 
many cases to unambiguously identify those row/column intersections at which 
the bit errors are present. Thus, as in the simple parity embodiment, all single 
double-bit word-wise errors can be detected and corrected. In addition, 

30 however, the BEDAC 54 can detect and correct all bit-wise single-bit errors, 

leaving any remaining stacked double-bit errors (now, hopefully, only single-bit 
word-wise errors) to be cleaned up in a follow-up scrub operation. Furthermore, 
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using the BEDAC 54, single occurrences of double stacked double-bit word-wise 
errors can be detected and corrected since such a unique error pattern can occur 
if and only if the bits located at all four of the row/column intersections are in 
error. 

5 By way of example, a typical T-flop, Tx, illustrated in Fig. 7, comprises a 

first D latch 64 and a second D latch 66, arranged in a classic toggle configuration 
with the Q output of the first D latch 64 coupled to the D input of the second D 
latch 66, and the D input of the first D latch 64 coupled to the Q output of the 
second D latch 66 via an inverter 68. An exclusive-Or gate (XOR gate 70) receives 
10 the new data bit to be stored in a memory cell of the DRAM 30, and the old data 
bit retrieved from that memory cell. So long as the old and new data bits match, 
the output of XOR gate 70 is a zero (0); when they differ, the output is a one (1). 
D The output of XOR gate 70 is input into an OR gate 72, together with a write 

S error signal. If either the old and new data values are different or the write error 
W 15 signal is asserted, the output of OR gate 72 is a one (1), and otherwise is a zero 
ij (0). ). As will be described below, the write error signal is used to correct the 

W contents of the T-flop when an error has been detected in the old code word. 

b_ On the occurrence of a write error or different old and new data bits, a one 

S (1) output from the OR gate 72 and input to AND gate 74 together with a 

t 20 respective one of the BCBS signals, BCBS[i] (described above in conjunction with 
5 Fig. 5). When the BCBS[i] signal is asserted and the output of OR gate 72 is a one 

* (1), AND gate 74 applies to the clock (C) input of the first D latch 64 a one (1), 

this occurs when there is either a write error or a difference has been detected 
between the old and new data bits. The AND gate 74 applies a zero (0) to the 
25 clock (C) input otherwise. Simultaneously, the BCBS[i] signal, inverted by an 
inverter 76, is applied to the clock (C) input of the second D latch 66. In this 
arrangement, the T-flop is selectively toggled in response to each assertion of the 
BCBS[i] signal, depending upon the output of OR gate 72. 

To facilitate system initialization, each D-flip-flop has an asynchronous 
30 initialization input, I, which is coupled to a Set/Clear (SET/ CLEAR) signal, the 
logic state of which depends upon the location of the respective T-flop in the 
BEDAC 54, as shown, for the given example, in the left-most column in Fig. 5A. 
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In this example, an odd parity scheme is being used so if an odd number of rows 
effect the BCBS signal the corresponding T-flop is cleared, and if an even number 
of rows effect the BCBS signal, the corresponding T-flop is set. In this way, the 
BCBSO for each column is set, the BCBS1 for each column is cleared, etc. Since the 
5 initialization scheme will be determined at design time, the SET/ CLEAR signal 
for each T-flop can be assigned the appropriate logic value in hardware. 

Referring now once again to Fig. 4, the memory system 44 also includes a 
sequencer 78 the state logic of which is selected to deal with the several forms of 
errors which the WEDAC 52 and BEDAC 54 are capable of detecting and/or 

10 correcting. One possible method for operating the sequencer 78 is illustrated in 
flow diagram form in Fig. 8, which comprises Figs. 8A through 8D. At 80, the 
DRAM 30 is initialized and the T-flops of the BEDAC 54 are set or cleared 
according to the selected, word-wise code bit assignments. At 82, the sequencer 
78 loops, waiting for a memory access. At 84, if the access is a refresh operation 

15 with no scrub operation, there is nothing for the sequencer 78 to do, so 

processing simply loops back to 82 after refreshing the DRAM 30. At 86, if the 
access is a refresh with scrub, then, at 88, single-bit errors are scrubbed using just 
the WEDAC 52. At 90, if no multi-bit errors are discovered, then processing 
returns to 82 to await a next access. However, if there is a row with a multi-bit 

20 error, processing continues at B, detailed in Fig. 8C. 

If, at 92, the access is a read, then the DRAM 30 is read, at 94. At 96, if no 
error is detected by the WEDAC 52, the read access is complete and processing 
returns to 82 to wait for the next access. If, at 98, no multi-bit error is detected, 
then, at 100, the single-bit error is fixed, completing the read access, and 

25 processing returns to 82 to await the next access; otherwise, processing continues 
at B, detailed in Fig. 8C. If, at 92, the access is not a read, and therefore must be a 
write, processing continues at A, which is detailed in Fig. 8B. 

Continuing with the write access, from A in Fig. 8B, the old code word is 
read from the DRAM 30, at 102. At 104, the new code word, which has already 

30 been generated by the WEDAC 52, is written into the DRAM 30 in order to 
minimize access cycle time in the usual case of no error. At 106, the bit-wise 
difference between the new and old code words is calculated to detect errors in 
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the accessed code word. At step 108, the T-flops of the BEDAC 54 are adjusted 
based on the values of each bit in the accessed code word according to the 
scheme of Fig. 5A, where each row effects multiple BCBs. Simultaneously, the 
old code word is forwarded to the WED AC 52 for error checking. If, at 110, the 
5 WEDAC 52 detects no error in the old code word, then processing returns to D to 
await the next access. On the other hand, if, at 112, a single-bit error is detected 
in the old code word, the BEDAC 54 corrects the appropriate T-flops at 114, and 
processing returns to D. If, at 112, a multi- bit error is detected in the old code 
word, then, at 116, the old code word is written back into the DRAM 30. At 118, 

10 the bit-wise difference of the old and new code words is then calculated, so that, 
at 120, the T-flops of the BEDAC 54 can be readjusted accordingly, thus restoring 
the system to the state that existed just prior to the current write access. Of 
course, an extra set of "shadow" latches (not shown) might be provided to 
selectively capture the information in the old and new sets of T-flops, thereby 

15 facilitating later state restoration. Processing then continues at B, detailed in Fig. 
8C. 

Illustrated in Figs. 8C and 8D is a special "panic" scrub subroutine that is 
invoked only in the event that a double-bit error is detected in the course of 
performing a normal read/write/refresh/scrub operation. During this 

20 operation, all of the code words in the DRAM 30 which are included in the parity 
space covered by the BEDAC 54 are accessed and checked for errors. At 122, the 
T-flops in the BEDAC 54 are set or cleared according to the predetermined bit- 
wise code bit assignments, such as in the example shown in Fig. 5A. After 
initialization, at 124, a first one of old code words, say code word 0, is read from 

25 DRAM 30 and, at 126, the BEDAC 54 recalculates the bit-wise check bits using 
the Ts-flops. At 128, if more old code words remain to be read, processing loops 
back to 124. After all of the old code words have been read and the bit-wise 
check bits recalculated, the corrupt bit select 58 generates a bit- wise syndrome, at 
130, using the shared syndrome calculator 60. So long as the syndrome decoder 

30 62 detects no errors, at 132, and there are bits remaining to be checked, at 134, 
processing loops back to 130 to continue sequencing through all bit locations 
until no bits remain to be checked, at which point processing returns to D to 
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await a next access. If, at 132, an error is detected, but it is determined, at 136, 
that it is only a single-bit bit-wise error, then, at 138, the error is fixed and 
processing rejoins the loop at 134. If, at 136, a multi-bit error is detected, 
processing continues at C, detailed in Fig. 8D. 

5 Continuing from C in Fig. 8D, at 140, the sequencer 78 determines if a 

predetermined stop criteria is satisfied. According to one embodiment, the 
predetermined stop criteria is a fixed number of times through the multi-bit 
correction process starting at 122. For example, the sequencer 78 may initialize a 
loop counter (not shown), at 122, and increment/ decrement it, at 122, until the 
10 counter reaches a predetermined maximum/ minimum value, at 140. 

Alternatively, the sequencer 78 may initially clear a loop control flag (not 
shown), at 80, set it each time a bit error is successfully corrected, as at 88 and 
O 138, and then check it (and clear it also), at 140, to determine if at least one error 

jp was fixed the last iteration through the process loop. Failure to fix at least one bit 

ti 15 error per loop iteration would constitute the stop criteria. In any event, if the 
J stop criteria is not satisfied at 140, processing returns to E. If, at 142, there are no 

W more errors, processing returns to D to finish the current access, if necessary, and 

!f await the next access. If errors still remain, then, at 144, the sequencer 78 

C- 1 determines if there are exactly four (4) bits still in error. The four-error case is a 

20 special circumstance which can occur only when there are exactly two stacked 
5 double-bit errors. If four errors are indeed detected, then, at 146, the sequencer 

3 78 can fix all four by reading out the two words that the WEDAC 52 indicates 

have double-bit word-wise errors and inverting just those bits that the BED AC 54 
indicates have double-bit bit-wise errors. Processing then returns to D to await 
25 the next access. If, at 144, this special four-error case is not detected, the memory 
system 44 has experienced a catastrophic breach of data integrity which cannot 
be repaired, since the information provided by the orthogonal WEDAC /BED AC 
combination is insufficient to identify the remaining bits in error.. Accordingly, 
at 148, upon detecting an uncorrectable error, a special, uncorrectable error flag 
30 (not shown) is raised to indicate that the information stored in the DRAM 30 is 
unreliable and should not be used, after which the sequencer 78 stops, at 150. 
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Returning now to Fig. 8A, at D, upon repairing any errors detected in the 
course of a read or a write access, the sequencer 78 completes the access, 152, and 
rejoins the main flow, at 82, to await the next access. Although we have 
illustrated what we believe to be a reasonable flow for the sequencer 78, many 
5 variations can be envisioned. The primary goal of all such flows is to efficiently 
detect and correct as many errors as possible, in view of the inherent limitation of 
the instantiated error detection and correction hardware as discussed herein, and 
to selectively report all uncorrectable errors. In achieving this goal, attention 
should be paid to minimizing the duration of time during which the DRAM 30 
10 will be unavailable for normal operation. As experience is gained with a DRAM 
30 manufactured in a particular technology, packaged in a selected medium and 
operated in a specific environment, the steps or states in the flow for the 
sequencer 78 can be rearranged or modified in accordance with our invention in 
3 order to optimize, for example, the overall reliability and data integrity of the 

J^f 15 DRAM 30, the area cost, or the time to perform a column-wise scrub. 

J: Depending upon the severity of a multi-bit error, the sequencer 78 may 

W not be required to scrub the entire parity space. For example, assume that a 

f j double-bit error is detected by the WED AC 52 in the code word retrieved from 

O row 2 of the DRAM 30. From the code table of Fig. 5A, it can be seen that each 

%1 20 bit of a code word stored in row 2 affects only three of the five column check bits: 

CCB0, CCB1 and CCB4. In order to fully verify the parity of, for example, just 
2 CCB4, then it is sufficient to examine the bits stored in only three of the other 

seven rows: row 4, row 5, and row 7. If, in the course of retrieving and checking 
the code words in these rows, no other word-wise multi-bit error is detected 
25 (remember that all single-bit errors can be immediately corrected by the WED AC 
52), then sufficient information is already available in the BED AC 54 to 
unambiguously identify the two bits in error in the code word retrieved from 
row 0. Similarly, it can be shown that all single occurrences of double-bit errors 
can be resolved by scrubbing around one-half of the parity space. On the other 
30 hand, since it is not assured that a second multi-bit error will not be detected in 
the minimal scrub space, the sequencer 78 should perform the panic scrub as if a 
second multi-bit error will indeed be detected, and then, if the assumption 
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proves incorrect, terminate the scrub early, as soon as the minimal scrub space 
has been examined. The information necessary to determine for each code word 
which other code words comprise the minimal scrub space is embodied, for 
example, in each of the respective CCBS[i] columns in Fig. 5B, and can easily be 
5 extracted using convention hardware techniques. In general, it can be shown 
that, for larger parity spaces and well-balanced encoding algorithms, the 
minimal scrub space approaches one-third of the parity space. 

Even though we have disclosed several preferred embodiments of our 
invention, we recognize that many other embodiments are not only possible but, 

10 perhaps, more desirable in specific circumstances. However, such alternate 

embodiments will, in general, employ variations or combinations of the disclosed 
embodiments. For example, as we noted above, it may be desirable in some 
applications to partition, logically or physically, the DRAM 30 by either 
rows/ words and columns /bits, or both, and provide separate and distinct 

15 WEDAC /BED AC units for each distinct coverage space. As we also explained 
above, increased error coverage can be obtained by interdigitating the coverage 
spaces, either by rows/words and columns /bits, or both. Of course, depending 
upon the reliability, cost and performance requirements of the memory system in 
particular application, combinations of no error coverage, parity only, or full 

20 ED AC, by rows/ words and columns /bits, or both, may be appropriate. 

Although we believe that our two-dimensional WEDAC/BEDAC 
embodiments are sufficiently robust to handle reasonably anticipated FIT rates 
for the memory densities available using present and near term semiconductor 
manufacturing/packaging technologies, we also anticipate that future 

25 technology improvements and the ever increasing demand for more system 
memory will raise FIT rates to the point where a two-dimensional instantiation 
may be inadequate. At such time, our RAEDAC invention can be easily 
reapplied to create a three-dimensional solution. For example, as shown in Fig. 
9, a very large memory array may be partitioned into a "stack" of, say (p+1), 

30 memory planes, each of which consists of the familiar planar array of rows and 
columns. In such an instantiation, each plane would have a respective 
WEDAC[0-p] for dealing with word-wise errors in the data array, and a 
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BEDAC[p-hl] for assisting an associated sequencer (not shown) in dealing with 
bit-wise errors, either in the data array or the dynamic check bit portion of the 
WED AC, as discussed above. In addition, however, a stack-wise EDAC or 
"SEDAC" (and associated sequencer, similar to that discussed in regard to Fig. 8) 
5 is provided to assist in dealing with stack-wise errors in the data and WED AC 
check bit arrays. Note that the SEDAC coverage could be extended to include 
the BEDAC check bit arrays in appropriate circumstances. It can be shown that 
many of the multi-bit error scenarios which are intractable for a two-dimensional 
instantiation become readily resolvable with the additional information available 

10 from the SEDAC. Of course, it will be readily apparent that higher-dimensional 
embodiments are certainly possible. In general, the degree to which the memory 
is partitioned into distinct parity spaces depends upon a number of interrelated 
issues, including price, performance, power, data integrity criticality, and the 
like. Similar considerations are relevant to the question of which form the 

15 RAEDACs should take: simple parity, multi-parity, interdigitated multi-parity, 
or full EDAC. 

Thus it is apparent that there has been provided, in accordance with the 
present invention, a method for providing distributed EDAC operation which 
reduces the complexity of the individual EDAC units and increases the speed of 

20 calculation. The present invention, in particular, provides a random access 

EDAC which is capable of assisting in the detection and correction of errors not 
accessible by a traditional one-dimensional EDAC unit. The combination of a 
WEDAC and a BEDAC allows more errors to be detected and corrected and 
reduces the instances of catastrophic data loss. Additional protection can be 

25 obtained using higher-dimensional RAEDACs, such as a SEDAC. Those skilled 
in the art will recognize that modifications and variations can be made without 
departing from the spirit of our invention. Therefore, it is intended that our 
invention encompass all such variations and modifications as fall within the 
scope of the appended claims. 
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Claims 

What we claim is: 

1. A memory system comprising: 

a memory comprising a plurality of dynamic memory cells, each cell 
5 storing a bit; 

an access circuit connected to the memory to access, during an access cycle, 
a selected set of the bits stored in said memory cells; 

an error detection circuit connected to the access circuit and said memory 
to detect an error in a bit accessed during said access cycle, comprising: 

10 a first error detection circuit to detect an error in a bit of a first subset 

of said accessed bits; and 

a second error detection circuit to detect an error in a bit of a second 
subset of said accessed bits. 

2. The memory system of claim 1 wherein said first and second error 
15 detection circuits also correct said bit errors, respectively. 

3. The memory system of claim 1 wherein said first and second subsets of 
said accessed bits are comprised of equal numbers of said bits. 

4. The memory system of claim 1 wherein said first and second subsets of 
said accessed bits are comprised of different numbers of said bits. 

20 5. The memory system of claim 1 wherein the memory system comprises 
an integrated circuit. 



23 



Attorney Docket No.: JMS009-00 



6. A memory system comprising: 

a memory comprising first and second groups of dynamic memory cells, 

each cell storing a bit; 
an access circuit connected to the memory to access, during an access cycle, 

a selected one of: 

a first set of the bits stored in said first group of said memory cells; and 

a second set of the bits stored in said second group of said memory 
cells; 

an error detection circuit connected to the access circuit and said memory 
to detect an error in a bit accessed during said access cycle, comprising: 

a first error detection circuit to detect an error in a bit of said first set of 
accessed bits; and 

a second error detection circuit to detect an error in a bit of said second 
set of accessed bits. 

7. The memory system of claim 6 wherein said first and second error 
detection circuits also correct said bit errors, respectively. 

8. The memory system of claim 6 wherein said first and second subsets of 
said accessed bits are comprised of equal numbers of said bits. 

9. The memory system of claim 6 wherein said first and second subsets of 
said accessed bits are comprised of different numbers of said bits. 

10. The memory system of claim 6 further including: 

a scrub circuit connected to the memory to scrub, during a scrub cycle, a 
selected one of: 

said first set of the bits stored in said first group of said memory cells; 
and 
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said second set of the bits stored in said second group of said memory 
cells. 

11. The memory system of claim 10 wherein said access cycle and said 
scrub cycle are non-overlapping. 

5 12. The memory system of claim 10 wherein said access cycle overlaps said 
scrub cycle, and wherein, during said access cycle, said access circuit accesses 
said selected one of said first and second sets of bits, and, during said scrub 
cycle, said scrub circuit scrubs the other of said selected first and second sets of 
bits. 

10 13. The memory system of claim 10 wherein the memory system 
comprises an integrated circuit. 
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14. A memory system comprising: 

a memory comprising a plurality of dynamic memory cells arranged in a 
plane of rows and columns, each cell storing a bit; 

an access circuit connected to the memory to access, during an access 
sequence, all of the bits stored in said plane of said memory cells; 

an orthogonal error detection circuit connected to the access circuit and 
said memory to detect an error in a bit accessed during said access 
sequence, comprising: 

a row error detection circuit to detect an error in a bit of a row of said 
accessed bits; and 

a column error detection circuit to detect an error in a bit of a column 
of said accessed bits. 

15. The memory system of claim 14 wherein said first and second error 
detection circuits also correct said bit errors, respectively. 

16. The memory system of claim 14 wherein: 

said memory comprises a plurality of said planes of rows and columns of 
said memory cells, the corresponding memory cells of each plane 
forming respective stacks; 

said access circuit accesses, during said access sequence, all of the bits 
stored in all of said planes of said memory cells; and 

said orthogonal error detection circuit further comprises: 

a stack error detection circuit to detect an error in a bit of a stack of 
said accessed bits. 

17. The memory system of claim 16 wherein the stack error detection 
circuit comprises a parity check circuit. 

18. The memory system of claim 16 wherein the stack error detection 
circuit comprises a RAEDAC unit. 
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19. The memory system of claim 14 wherein the column error detection 
circuit comprises a parity check circuit. 

20. The memory system of claim 14 wherein the column error detection 
circuit comprises a RAEDAC unit. 

21. The memory system of claim 14 wherein the row error detection circuit 
comprises a parity check circuit. 

22. The memory system of claim 14 wherein the row error detection circuit 
comprises an EDAC unit. 

23. The memory system of claim 14 wherein the memory system 
comprises an integrated circuit. 
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24. A circuit for use in a memory system comprising: 

a memory comprising a plurality of dynamic memory cells arranged in a 
plane of rows and columns, each cell storing a bit; 

an access circuit connected to the memory to access, during an access 
5 sequence, all of the bits stored in said plane; and 

a row error detection circuit connected to the access circuit and said 
memory to detect an error in a bit of a row of said accessed bits; 

the circuit comprising: 

a parity generation circuit connected to said memory to generate a parity 
10 bit related to all bits stored in a respective one of said columns. 

25. The circuit of claim 24 further comprising: 

a parity check circuit connected to said memory and to said parity 
generation circuit to detect an error in a bit of a column of said 
accessed bits using said parity bits. 

15 26. The circuit of claim 25 further comprising: 

an error correction circuit coupled to the memory and to the parity check 
circuit to correct said detected column bit error. 

27. The circuit of claim 24 wherein the parity generation circuit generates a 
plurality of check bits, each related to a unique combination of at least two of 

20 said bits stored in a respective one of said columns. 

28. The circuit of claim 27 further comprising: 

an error detection circuit connected to said memory and to said parity 
generation circuit to detect an error in a bit of a column of said 
accessed bits using said parity bits. 

25 29. The circuit of claim 28 further comprising: 
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an error correction circuit coupled to the memory and to the parity check 
circuit to correct said detected column bit error. 

30. The circuit of claim 29 further characterized as an integrated circuit, 
random access error detection and correction (RAEDAC) unit. 
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31. A random access error detection and correction (RAEDAC) unit for 
detecting and correcting errors in an ordered bit string of predetermined 
length, the RAEDAC comprising: 

a parity generation circuit which receives, in any order, each bit of said 
5 string, and generates a plurality of parity bits, each related to a unique 

combination of said bits comprising said string; 

an error detection circuit connected to said parity generation circuit to 
detect an error in a bit of said string using said parity bit; and 

an error correction circuit coupled to the parity check circuit to correct said 
10 detected bit error. 

32. The RAEDAC of claim 31 wherein the error detection circuit detects 
multi-bit errors in said string using said parity bits. 
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METHOD AND APPARATUS FOR ERROR DETECTION 
AND CORRECTION 

Abstract of the Disclosure 

A Random Access Error Detection and Correction unit (RAEDAC) that 
5 incorporates a bit-wise error detection and correction unit (BEDAC) in a memory 
system. In one embodiment, a word-wise error detection and correction unit 
(WED AC) operates in coordination with a BEDAC that performs a bit-wise 
parity calculation. In another embodiment, a WEDAC operates in coordination 
with a full bit-wise BEDAC that calculates bit-wise check bits. The RAEDAC 
10 may be applied to create a multi-dimensional EDAC where, for example, the 

memory is partitioned into a stack of planes, and a stack-wise error detection and 
correction unit (SEDAC) is implemented. 
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believe that I am an original, first and joint inventor of the subject matter which is claimed and for 
which a patent is sought on the invention entitled: 

Method and Apparatus for Error Detection and Correction 

TITLE OF INVENTION 
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REVIEW OF PAPERS AND ACKNOWLEDGMENT OF DUTY OF CANDOR 

I hereby state that I have reviewed and understand the contents of the above identified 
specification, including the claims, as amended by any amendment referred to above. 
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thereof, or patented or described in any printed publication in any country before my or our invention 
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sale in the United States of America more than one year prior to this application. 

I acknowledge the duty to disclose to the Patent and Trademark Office information which I 
know is material to the patentability of this application in accordance with Title 37, Code of 
Federal Regulations, § 1.56. 
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The invention claimed in the above-described specification has not been patented or made the 
subject of an inventor's certificate issued before the date of this application in any country foreign to 
the United States of America on an application filed by me or my legal representatives or assigns 
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POWER OF ATTORNEY 

I hereby appoint Jeffrey Van Myers (Reg. No. 27,362), Kent J. Cooper (Reg. No. 37,296), and 
Sandra L. Godsey (Reg. No. 42,589), who are registered to practice before the Patent and Trademark 
Office, as my attorney with full power of substitution and revocation, to prosecute this application, 
to make alterations or amendments therein, to receive the patent and transact all business in the 
Patent and Trademark Office connected therewith. 

All CORRESPONDENCE should be addressed to: 

J.V. Myers & Associates, P.C. 
7101 Highway 71 West Suite 214 
Austin, Texas 78735 
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true; and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, under Section 1001 
of Title 18 of the United States Code, and that such willful false statements may jeopardize the 
validity of the application or any patent issued thereon. 
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